EXTREMAL PROBABILISTIC PROBLEMS AND HOTELLING'S T 2 TEST 

UNDER SYMMETRY CONDITION 1 

by Iosif Pinelis 



We consider Hotelling's T 2 statistic for an arbitrary d-dimensional sample. 
If the sampling is not too deterministic or inhomogeneous, then under zero 
means hypothesis, T 2 tends to x\ in distribution. We are showing that a test 
for the orthant symmetry condition introduced by Efron can be constructed 
which does not essentially differ from the one based on x\ and at the same 
time is applicable not only for large random homogeneous samples but for all 
multidimensional samples without exceptions. The main assertions have the 
form of inequalities, not that of limit theorems; these inequalities are exact 
representing the solutions to certain extremal problems. Let us also mention 
an auxiliary result which itself may be of interest: Xd — (d — 1) 5 decreases 
in distribution in d to its limit N(0, i). 

1. Introduction 

Efron (1969), Eaton and Efron (1970) discovered that the Hotelling's statistic 
(1.1) T 2 = XC~ 1 X T 

possesses a strong conservativeness property; here and further X is the sample mean, C is 
the sample covariance matrix. 

To describe it, put 

T 2 



(1.2) R z 



1 + T 2 ' 



Under some natural conditions, including zero means, both nT 2 and nR 2 tend in distri- 
bution to Xd as n ~ > 00 ' where n is the volume of the sample, d is the dimension of the 
sample space. 

Efron (1969), for d = 1, Eaton and Efron (1970), for all d, proved that 
(1-3) Ef(n*R) < Ef(xd) 
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for f(u) = -u 2m , m = 1,2,...; the only requirement they put on the sample was the 
so-called orthant symmetry condition (we should not like to discuss now extensions of 
definition (1.1) applicable to an arbitrary, not necessarily normal sample when C~ l can 
not exist). 

Our main result here in the paper is that (1.3) holds for every convex function / belonging 
to the class C^ onv of all even functions having convex second derivative; besides, the 
inequality 

(1.4) P(n*R > x) < c- P(xd > x), x>0, 

is being extracted from (1.3), with the best possible constant c = 2e 3 /9. 

The factor 2e 3 /9 can be found in Eaton (1974). Earlier, it was proved in Eaton (1970) 
that 

(1.5) E/(5„) < E/( X i), 

where S n = e\X\ + • • • + e n x n , x\ + • • • + x\ = 1, e±, . . . , e n are independent identically 
distributed (i.i.d.) random variables (r.v.'s) with P(e$ = 1) = P(q = —1) = |, i = 
1, . . . , n; / belongs to a certain subclass of the class F of all differentiable even functions / 
such that [f'{t + A) + f'(t — A)] ft is non-decreasing in t > for each real A. Then, based 
on (1.5), Eaton (1974) obtained the inequality 

P 5 n > x )< ^-^e 1 - — , x > 35, 

9 x \ x z J 

i 2 

with <p(x) = (27r)~2e _: V, and stated the conjecture that 

P(Sn > x) <^m, x>2i . 

9 x 



In this paper, we are proving that (1.5) holds for all / of the class F (which actually 
coincides with C^ onv ) and that 

(1.6) PlSn^Xya-^Xy^ 

for all x > 0, where 4>(x) = f_ oo cp(u)du. 

Moreover, we give analogues of (1.5), (1.6) for definite quadratic forms in ei,...,e n 
(instead of linear forms such as S n ) to which (1.3), (1.4) are simple corollaries. 
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In section 2 below, we give our variants of strict definitions and also representations of 
T 2 and R 2 for any multidimensional sample. 

In section 3, we reproduce the orthant symmetry condition by Efron. 

In section 4, we present probabilistic problems related to (1.5), (1.6). 

In section 5, based on the monotonicity of the likelihood ratio, it is shown that 

Xd — (d — 1) 2 decreases in distribution to its limit iV (0, -|) when d f oo. 

In section 6, the results of the preceding sections are used to obtain (1.3), (1.4); the size 
of the corresponding confidence region is discussing. 

Section 7, Appendix, contains some proofs. 

2. Hotelling's T 2 statistic for an arbitrary distributed multidimensional sample. 

Let Xl, . . . , X n be a sample such as in the title of the section, Xi e M d ; we identify R d 
with M ld ; M nd denotes the set of all n x d real matrices. Consider 

n n 

X = ^Xi/n, S = ^XfXi/n, C = S -ifx, 
i=i i=i 

i.e., the sample mean, the matrix of second sample moments and the sample covariance 
matrix, resp.; T means transposition. 

If e.g. the sample is normally distributed, then C -1 exists with probability (w.p.) 1, 
and T 2 statistic is defined as 

(2.1) (n-l)XC- 1 X T . 

In general, it may be defined as in Eaton and Efron (1970): 

(2.2) T 2 = cot 2 0, 
where 6 is the angle between vector 

(2.3) z/ = (l,...,l) T GM nl 
and the linear hull L(X) of the columns of the matrix 

x = (x?...xZ) T , 
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iih line of which is Xi (it is convenient to omit the factor n — 1 in (2.1), which corresponds 
to (1.1)). It is also reasonable to define another statistic, 

(2.4) R 2 = cos 2 6, 

i.e., 

( 2 -5) R 2 = Y+T^' 

assuming that R 2 = 1 T 2 = oo (to specify (1.2)). 

We suggest a somewhat different approach to the general-case definition of T 2 statistic, 
closer to (1.1). It leads to the same notions as in (2.2), (2.4). Namely, put 

(2.6) T 2 = limX(C + eI)~ 1 X T , 

ej.0 



(2.7) R 2 = limX(S + eI)- 1 X T , 

ej.0 

where / is the unit matrix; the limits here always exist, finite or infinite. (Replacing 
here / by any strictly positively definite symmetric matrix (p.d.m.), one would obtain the 
equivalent definitions since the matrix function A — > A -1 is monotone on the set of all 
p.d.m.'s; see Marshall and Olkin (1979)). Indeed, considering 

T 2 = XC^X 7 \ R 2 = XS~ l X T , C e = C + eI, S c = S + el, 

we have 

T 2 R 2 = XC^jfxS- 1 !^ 

= XC-\S £ - C^S^X* = T 2 - Rl 

which implies (2.5) when using definitions (2.6), (2.7). 
Consider the matrix 

(2.8) p = x{x T xyx T 

of the orthoprojector from M n \ onto L(X); A~ denotes a (7-inverse matrix for A (Rao 
(1965)). Taking into account equalities X = v T X/n, S = X T X/n and "quasispectral" 
representation for X (Rao (1965), Ch. 1, appendix 6.1), it can be seen that 

2 ^_^T 



(2.9) R l = XS~X , 
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which is equivalent to 

(2.10) nR 2 = v T Pv, 

i.e., definition (2.7) coincides with (2.4), and, in view of (2.5), definitions (2.2) and (2.6) 
are equivalent too. 

In contrast to (2.9), the equality T 2 = XC~ X is not always true; the simplest reason 
is that the right-hand side of it cannot be infinite. 

3. Orthant symmetry. 

This condition was defined in Efron (1969), Eaton and Efron (1970) as 

(3.1) (Xl, . . . , X n ) = (eiXi, . . . , e n X n ) 
or, equivalent ly, 

(3.2) X = A £ X, 

where = means equality in distribution, A e = diag{ei, . . . , e n } is the diagonal matrix with 
ei, . . . , e n on its diagonal, ei, . . . , e n are i.i.d.r.v.'s independent of X 

P( ei = ±l) = i i = l,...,n. 

It was mentioned in Eaton and Efron (1970) that under the orthant symmetry condition, 

(3.3) nR 2 = e T Pe, 

where e = (ei, . . . , e n ) T . It may be also deduced from (2.8), (2.10), (3.2) and the equalities 
e = A e z/, A 2 = I. 



4. Some extremal probability problems. 

Let / be a locally bounded Borel even function on R; A e M nn ; A > 0, i.e., A is 
a nonnegatively definite matrix; eo, ei, . . . , e n , ^o 5 ^i 5 • • • are independent r.v.'s; e = 
(ei, . . . , e n ) T as above; ^ = (£i, . . . , £ n ) T ; and ^ are symmetrically distributed, P(e* = 
1) = |, E^ 2 = 1, z = 0,1,..., n; does not coincide in distribution with eo; x = 

(x u ...,x n ) T e M nl . 
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In section 1, there were mentioned the class C 2 onv of all real even functions / on R having 
finite and convex (and hence continuous) second derivative /" and the class F of all real 
differentiable even functions / on R such that [f'(t + A) + f'(t — A)]/t is non-decreasing 
in t > for each A G R. 

Theorem 4.1. The following statements are mutually equivalent. 

(i) Ef((e T Ae)^)<Ef((eAO^At; 

(ii) E/(eiXi H h t n x n ) < E/(^xi H h £, n x n )\/x, £; 

(iii) / G C 2 conv - 

(iv) feF- 

(v) g cJ G C c 2 on „Vc > 0, where g cJ (u) = f((u 2 + c)*)); 

(vi) 3^o bounded w.p. 1 Va, 6 G R 

(4.1) E/(a + 6e ) <E/(a + 6^ ); 

(vii) V^oVa, 6 G R (4.1) holds; 

(viii) 3£o bounded w.p.l 

(4.2) Ef( m +be ) <E/(7yo + ^o) 

for every random variable ?yo bounded w.p.l and independent of eo,£o- 

(Boundedness of & is required nowhere but in (vi), (viii). Of course, some expectations 
may be infinite.) 

Proof. See Appendix. 

Define the class C 2 conv of all the functions / G C 2 onv which are convex on [0, oo) or, 
equivalently, are non-decreasing on [0, oo) and, besides, are nonnegative; in other words, 
/ G C 2 onv and f(0) > 0, f"(0) > 0; the additional requirement of being nonnegative does 
not diminish the generality of following corollaries 4.2 - 4.4, but we do need it in corollaries 
4.5, 4.6 and in some places further. 

Corollary 4.2. If / G C\ conv , then VA,£,z 

E/((7fA#) < E/((£ T A#), 

E/(?7iXi H V 7] n X n ) < E/(^iXi H h Cn^n), 



where r\ = (771, . . . , ?? n ) T , rji are independent r.v.'s, = 0, \rfi\ < 1 w.p.l. 
Proof. See Appendix. 

Corollary 4.3. If P is an orthoprojector matrix, then 

Ef((e T Pe^)<Ef( Xr ), 

where r = rank P, Xr = (Xr)^> Xr is a r - v - with distribution, / G C 2 onv . If, moreover, 
/ G C| com; , then one can substitute here 77 for e. 

Proof. It suffices to note that £ T P£ = xl if & ~ AT(0, !)• 

This corollary in the case f(u) = -u 2m , m = 1, 2, . . . , was proved, in essential, in (Eaton 
and Efron (1970), corollary 6.1). 



Corollary 4.4. If £1 is JV(0, 1) r.v. and x\ H + £ 2 = 1, then V/ G C 2 on „ 

E/(eixi + • • • + e n x n ) < E/(£i). 
If, moreover, / G C 2 conu , then we can substitute 77 for e. 
Proof. Put P = xx T in corollary 4.3. 

This corollary was proved in Eaton (1970, 1974) for / G F (and so, in view of the 
equivalence (iii) <^ (iv) of theorem 4.1, for / G C 2 onv ) but under the additional restriction 
E\f(T n )\ 1+s < M, 5 > 0, T n = e T x or ifx, resp. 

Corollary 4.5. Under the conditions of corollary 4.3, 

(4.3) P(r/ T Pr? > u 2 ) < Q r {u) 

2e 3 

(4.4) < -g-P(-Xr > u), u> 0, 
where 

Q r (u) = inf{Ef( X r)/f(u) : / G C 2 convl /(«) > 0}. 

Proof. See corollary 4.3 for (4.3) and (4.13) below for (4.4). 
Corollary 4.6. Under the conditions of corollary 4.4, 

(4.5) ?(vixi + ■■■ + VnX n > u) <Q 1 (u)/2 

(4.6) <^_p(^> u ), M >o. 



8 



Proof. Put P = xx T in corollary 4.5. 

A statement close to (4.5) was given in Eaton (1974); (4.6) is an improvement of corol- 
laries 1, 2 in Eaton (1974) and of the conjecture following those corollaries therein. 

Let us provide further information on Q r (u) which, in particular, contains (4.4), (4.6). 

Proposition 4.7. 



(4.7) Q r (u) = min[l, r/u 2 , W r (u)} 

1 if 0<«<ri 

(4.8) = < r/u 2 if ri < u < [i 



W r (u) if u > fi 7 



where 



W r (u) = inf{(u - t)- 3 E( X r ~t) 3 + : t G (0, u)}, 

(4.9) n r = Exl/Ex 2 r ; 

besides, 

(4.10) fi r e((r+l)K (r + 2)3). 
Proof. See Appendix. 

Proposition 4.8. Constant 2e 3 /9 = 4.463 ... is the best possible in (4.4), (4.6) in the 
sense that for each r 

2e 3 

(4.11) Q r (u) ~ — P(Xr > u), u -> oo; 

here and in what follows a ~ b means a/b — > 1. 
Proof. See Appendix. 
Consider the ratio 

A r (u) = Q r (u)/P(Xr > U), 

and define 

/•OO 

q = q( u ) = q r (u) = / s^e'" 2 / 2 I{s > 0}ds 

J u 
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so that 

P( X r>u)=q(u)/q(0); 
here and further, I {A} = 1 when A is true, I {A} = otherwise. 

Proposition 4.9. 



(4.12) A r (u) < (2e 3 /9) + 3[J(a u ) - J{S)}, u > fi r , 

(4.13) A r (u) < 2e 3 /9, u > 0, 
where 



a u = 3g(u)g"(u)/g'(u) 2 > 0, 

(4.14) J(a) = 6a" 4 (e a - 1 - a - a 2 /2 - a 3 /6); 

(4.15) a u | 3 (it > /U r , ti t oo); 

(4.16) ^7(a) increases in a > 
(actually in all a, with i7(0) = |, but we need not this improvement). 

Proof. See Appendix. 

Proposition 4.10. Function Ai(it) increases in u > n\. 
Proof. See Appendix. 

The most non-trivial of propositions 4.7 - 4.10 is of course proposition 4.9. It shows 

that A r (ix) < A r (ix), where A r (it) = ^ + 3(J(a u ) - J (3)) T ^f", u > Hr- Apparently, 

A r (u) itself increases in it > \x r for each r (to 2e 3 /9, in view of proposition 4.8), but 
we can prove this fact only for r = 1 (proposition 4.10); a scheme of proof like that of 
proposition 4.10 may do for all r though it becomes too complicated in the general case. 
Nevertheless, proposition 4.9 given here may often be more exact and useful than that 
hypothetic qualitative result. 



5. The monotonicity of a likelihood ratio and a stochastic majorization. 
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Consider the family % r — (r — 1)^, where r is any real number in [l,oo), Xr has the 
density 

(5.1) p r (u) = a« r_1 e"" 2/2 /{« > 0}, 

C r depends only on r; one can see that (r — 1)^ is the mode of Xr- 

We shall show that this family has monotone likelihood ratio and hence is stochastically 
monotone. Let (E, <) be any partially ordered set. 

We say that a family (£ r : r G E) of r.v.'s having densities (p r : re E) has monotone 
likelihood ratio (MLR) if the implication 

r < d, r G E, d e E, -oo < s < t < oo 

(5.2) =>- p r (t)pd(s) > Pd(t)p r (s) 

is true. In the case p r (t) > OVr, t, this definition just means that Pd/p r is non-increasing 
on R when r < d. 

We say that a family of r.v.'s (£ r : r G -E 1 ) with the tails F r (t) = P(£ r > t) has monotone 
tail ratio (MTR) if the implication 

r < d, r G E, d G E, —oo < s < t < oo 

(5.3) F r (t)F d (s) > F d (t)F r (s) 
is true. 

A family (£ r : r G -E 1 ) of r.v.'s is called stochastically monotone (SM) if 

(5.4) r < d, reE, deE, t G R F r (t) > F d (t); 

this definition of the stochastical monotonicity or, in other words, monotonicity in distri- 
bution, is generally accepted (see, e.g., Marshall and Olkin (1979)). 

Proposition 5.1. If (£ r ) has MLR, then it has MTR. 

Proof. If (£ r ) has MLR, and — oo < s < t < oo, then 

F r {t)[F d (s)-F d {t)}= J J p r (v)p d (u)dvdu> 

s<u<t<v 
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> 



// 



p d (v) Pr (u)dvdu = F d (t)[F r (s) - F r (t)}, 



and so F r (t)F d (s) > F d (t)F r (s)(r < d). 
Proposition 5.2. If (£ r ) has MTR, then it is SM. 
Proof. In (5.3), tend s to — oo. 

Theorem 5.3. The family far - (r - 1)* : r > 1} has MLR. 
Proof. Take d > r > 1 and put a = (r — 1)^, 6 = (d — 1)2. Then 



if u > —a, hence p r (t)pd( s ) > Pd(t)p r (s) when —a < s < t; if s < —a, then p r (s) = 0, so 
this case is trivial. 

Corollary 5.4. This family has MTR. 

Proof. See proposition 5.1. 

Corollary 5.5. This family, {x r — (r — 1)^ : r > 1}, is SM. 
Proof. See proposition 5.2. 

Remark 5.6. Consider £ r = % r — (r— 1)^, 1 < r < oo; let £oo have iV (0, -|) distribution. 
Then £ r -> £ oo in distribution when r — > oo, and one may supplement the family in 
statements 5.3 - 5.5 by £oo. 

Remark 5.7. If required additionally that inequality (5.2) should be strict for some 
s, r < d, r 7^ d on a positive-measure set of values t such that t > s, then the conclusions 
of propositions 5.1, 5.2 can be also improved so that inequalities (5.3), (5.4) become strict 
for those values of s,r,d. In particular, statements 5.3 - 5.6 can be improved so that 
inequalities (5.3), (5.4) become strict whenever 1 < r < d < oo, — (d — l)?<s<t< oo, 
for £ r = Xr — (r — 1)^, 1 < r < oo, and £oo having iV (0, |) distribution. E.g., for all 



where <p is N(Q, 1) distribution function; this is an improvement of the central limit theorem 
for Xr- 



(log[p d (u)/Pr («)])' = (r~ d)u 2 /{a + b)(u + a)(u + b)<0 



«6R, d > 1, 



(5.5) 



P(Xd-(d-l)* >u)>l-0(23 U ), 



6. Inequalities for the distribution of T 2 under the orthant symmetry condition. 
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It is known that if X\, . . . ,X n , . . . are i.i.d. random vectors with zero means and a 
finite non-degenerate matrix of second moments, then the distribution of T 2 (and hence 
that of .R 2 ), is close to that of Xdl n when n — > oo. Similarly to Efron (1969), Eaton and 
Efron (1970), we are showing that some kind of conservativeness holds under the orthant 
symmetry condition only; this conservativeness is stronger than that in Efron (1969), Eaton 
and Efron (1970). 

We permanently suppose that the orthant symmetry condition takes place. 
Theorem 6.1. If / G C£ cont> , then 

E/(n*i?) < Ef( Xd ), 

where R = (R 2 )*. (When f(u) = u 2m , m = 1,2,..., this inequality was proved in Eaton 
and Efron (1970)). 

Proof. See (3.3) and corollary 4.3. 

The class C 2 conv is sufficiently large to make it possible to obtain the following, main 
result of this paper. 

Theorem 6.2. For all u > 

(6.1) P(n*R >u)< Q d {u) < ^~P(Xd > u). 

Proof. See (3.3) and corollary 4.5. 

One can find further information about Qd(u) in propositions 4.7 - 4.10. 
Consider now 5-quantiles Xd{S) and Xd{6), < 8 < 1, for R and Xd, resp., i.e., 
P(x d > x d (S)) = 5, x d (5) = inf{x e R : P(n^R > x) < 5}. 
In particular, P(n^R > Xd(5)) > 5 > P(n^R > Xd(S)). 
Theorem 6.3. If S < 0.5, then x d (S) > (d - l)s and 



(6.2) x d {5) < x d {S/c) 

(6.3) < x d (S) + [x d (S) - (d - l)^)]- 1 logc 

(6.4) < x d (5) + [x d (5) -(d- l)^]" 1 logc 

(6.5) <x d (S) + o(l) 

(6.6) < x d (6)(l + d-* -o(l)) 

(6.7) <x d (S)(l + o(l)), 
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where o(l) — > uniformly in d, n, Xi, . . . , X n when 5 | 0; c = 2e 3 /9. 
Proof. See Appendix. 

Thus this theorem means that the size Xd{5) of Hotelling's criterion under the orthant 
symmetry condition can exceed the limit size Xd(S), if can, only by some negligible value. 
It is just that conservativeness property we spoke about. 

Moreover, this theorem implies that the greater the dimension is, the even better the 
situation becomes (the same tendency takes place when 5 decreases). This is illustrated 
by the following numerical data for 5 = 0.05, where x$ stands for Xd(S), z§ = x$ + [x$ — 
(d - l)/^]" 1 • logc. 



d 


1 


2 


5 


10 


20 


50 


oo 


X8 


1.96 


2.45 


3.33 


4.28 


5.61 


8.22 


d^ +1.16 


%6/c 


2.54 


3.00 


3.85 


4.78 


6.10 


8.69 


dh +1.61 


zs 


2.72 


3.18 


4.03 


4.97 


6.28 


8.88 


dh +1.80 



7. Appendix. 

Proof of theorem 4.1. It consists in checking the following implications: 
(vii) =>- (vi) =>- (viii) =>- (hi) =^ (iv) =^ (vii), 
(iii) (v) (i) (ii) (vii). 
Let us proceed. 

(vii) =>- (vi). It is trivial. 

(vi) =>- (viii). Inequality (4.2) is an "average" of (4.1). 

(viii) =>- (iii). For the beginning, suppose that / has continuous 4th derivative 
Then Taylor's expansion gives 

(E/(o + &&) - f(a) - f"(a)b 2 /2) • 246" 4 -> /W(a) • E£ 4 

when 6^0. Using the analogous expansion with eo instead of £o 5 one can see that 
/( 4 )(a)(E£Q — 1) > 0. But E£ 4 > 1 since, as it was fixed, £ does not coincide with 
eo in distribution and E£q = 1. Thus f^ 4 \a) > 0, and so /" is convex. To check now 
the general case, put f m (u) = E/(w + rjo/m), m = 1,2,..., where rjo is a bounded r.v. 
with a sufficiently smooth density. By what has been already proved, fm(u) > ViteR. 
Consider the operator A 3 f(u) = f(u + 3) - 3f(u + 2) + 3f(u + 1) - f(u). Then Vw > 
039 G (0, 3) A 3 f m (u) = C{u + 6) > f%(u) > f%(0) = since /i 4) > and therefore f% 
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is non-decreasing. Note that f m (u) — > f(u) almost everywhere (a.e.). Hence, A 3 f m (u) is 
bounded a.e., and so is f^(u). By Helly's theorem, — > h weakly on each compact in R, 
for some subsequence of the values of m — > oo and some non-decreasing finite function /i. 
Integrating by parts 3 times or, more exactly, using Fubini theorem, it is easy to see that 

(7.1) / G C 2 conv & /(«) = a + 6u 2 /2 + J(\u\- t)\u f (dt)/% 

t>o 

for some real a,b and a a-finite nonnegative measure Vf, where u+ = max(«, 0); besides, 
if / G C 2 conv , then a = f(0),b= /"(0),Vu > 0i//([0,u]) = f"'{u + 0), where /'"(u + 0) 
is the right derivative of convex function f"(u); (7.1) is but a kind of Taylor's expansion. 
Further, 

/ {\u\-t)ldfZ{t)^ I (\u\-tf t dh(t) 

Jt>0 Jt>0 

since — > h weakly on compacts. Hence, 3a, b 

/ m (0)->a, /£(0)->&, 

f(u)=a + bu 2 /2+ [ (\u\ - t)\dh{t); 
Jt>o 

thus, / G C 2 on „. 

(iii) =>- (iv). Essentially, it was proved in (Eaton (1974), Lemma 1); we need only to 
substitute f"(u + 0) for f"(u) in Eaton (1974). 

(iv) =>- (vii). Put g(x) = f(a + bx^) + f(a - bx%), x > 0. Then, / G F g' is 
non-decreasing in x > 0. Therefore, g is convex on [0, oo), and 

2E/(a + be ) = g(l) < Eg^ 2 ) = 2E/(a + 

by Jensen's inequality. 

(v) =>- (iii). Take c = 0. 

(iii) =>- (v). In view of (7.1), it is sufficient to prove that Vt > OVc > the function 
g{u) = (z* — z = u 2 + c, belongs to C 2 onv . Calculations show: 

1) z = t 2 u = u + or u = U-, where u± = ±(t 2 — c)^; here it is necessary that 
t 2 > c; further, 

g"'(u± + 0) - (7 w (w± - 0) = 6t" 3 (t 2 - c)' > 
is t > 0; u± = 0, #'"(0 + 0) - #"'(0 - 0) = 12 > when t = (and, hence, c = 0); 
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2) z < t 2 gW(u) =0; 

3) z > t 2 =>• g {4) (u) = 9cz-^[(z-5t 2 )c + At 2 z] > 9cz~? ■ min(z - t 2 , At 2 ) > since 
< c < z. Thus g"'(u + 0) is non-decreasing in u G M, g G C 2 onv . 

(v) =>- (i). Note that e T Ae = (aei + f3) 2 + c, where c > 0, a, /? do not depend on e\. 
Function 

h(u) = f(((au + (if + c) 1 *) + f(((-au + (if + c)*) 
= g c j(au + (3)+ g c j(-au + (3) 

belongs to C 2 onv as g c ,f(u) does. Thus 

2E/((e T Ae)^) = E/i( ei ) < Eh(^) = 2E/((e t Ae)*), 

where e = (£i, €2, . . . , e n ) T , because, as we have already proved, (v) =>- (iii) =>- (iv) =>- (vii). 
Successively replacing the remaining q's by £j's, we are coming to (i). 

(i) (ii). Put A = xx T . 

(ii) =>- (vii). Take n = 2 and note that 

Ef(a + b£o) = E/(-a + 6£o) = E/(aei + b£ ). 



The theorem is proved. 

Proof of corollary 4.2. If f(u) = bu 2 , b > 0, then the inequalities we are proving are 
evident. In view of (7.1), it suffices to consider the case f(u) = (\u\) — 1)+, t > 0. In this 
case, calculations show that (cf. with the proof of statement (hi)=>- (v) of theorem 4.1) 
g c j is convex on R. Therefore, the function h in the proof of the implication (v)=>- (i) of 
theorem 4.1 is convex. It remains to use the following inequality due to Hunt (1955), see 
also Eaton (1974): 

EG(7/i,...,7/ n ) < EG( ei ,...,e n ), 
if G is a function convex in each its argument. 
In the following proofs we need some auxiliary results. Put 

poo 

1 = 1 (u)=7r(u)=~E(Xr-u)l/C r = / (S- ufp r (s)ds/C r 

J u 

poo 

= (s-u) 3 s r - 1 e- s2 / 2 I{s>0}ds, 

J u 
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where p r is defined by (5.1), and, as in section 4, 

/•OO 

q = q(u) = q r (u) = / p r (s)ds/C r 

J u 

poo 

= / s^e-* 2 / 2 /^ > 0}ds. 

J u 

Lemma 7.1. For all r > 0, j = 0, 1, 2, 3, 4 



(7.2) (-1)V J ' } H > 0, mGI, 

(7.3) (-1)V j) («) ~ 6w r - 5+J e- u2/2 , u -> cx) 

(7.4) 7 ( 3 )(n) = -6 9 ( W ), 

(7.5) 7 (4) (w) = 6u r_1 e- ,i2/2 /{u > 0} = -6g'(u), u ^ 0, 

7 ( 5 )(V) = -6g"(u) = 6[u - (r - l)/u]g'(u) = 

(7.6) = -[u- (r- l)/w] 7 (4) (w), u > 0, 



where ^\u) = {d J / du J )^(u), j = 1, 2, . . . ,7 (0) («) = 7(«). 
Proof. Equalities (7.4) - (7.6) are the results of simple calculations. By L'Hospital's rule, 

<y(u)/(6u r - 5 e- u2/2 ) ~ 7 '(u)/(6[(r - h)u r ~ G - i/" 4 ^"" 2 / 2 ) 

~ -Y(u)/(6u r - 4 e- u2 / 2 ) 7 ( 4) (w)/(6w r - 1 e- u2 / 2 ) = l(u -> oo), 

and so (7.3) is proved. In particular, 7^)(tt) — > 0(it — > oo). This, together with (7.5), 
implies (7.2). 

Lemma 7.2. For each r > 1 

(7.7) g(«)g"(«)/g'(«) 2 | 1 when u > (r — l) 5 , u f oo, 
or, equivalent ly, 

(7.8) (u- (r- l)/w) _1 > -q{u)/q'{u) 
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(7.9) > (u - (r - l)/u)/[{u - (r - l)/u) 2 + 1 + (r - l)/u% u > (r - 1)3. 

Proof. In view of (7.6), inequality (7.9) means that /(it) = q(u)q"(u)/q'(u) 2 increases 
in u > (r — 1)2, and (7.8) means that f(u) < 1. On account of (7.3), (7.4), we see that 
f(u) — > l(u — > oo), and so 

(7.7) (7.8) & (7.9) (7.9). 

Write (7.9) as g(u) > 0, w > (r — 1) 3 , where we put 

(/(u) = q(u) + q'{u){u - (r - l)/u)/[(u - (r - l)/w) 2 + 1 + (r - l)/w 2 ]. 

Taking into account (7.3), (7.6), we see that g{u) — > as u — > oo and 

</'(«) = ?»[(1 + (r - l)/« 2 ) 2 + (u - (r - l)/«)(r - l)/« 3 ] 

x [(u - (r - l)/w) 2 + 1 + (r - l)/w 2 ]" 2 < 0, u > (r - 1)'. 

Hence #(«) > V-u > (r — 1)3, and the proof is completed. 

Proof of proposition 4.7. Using (7.1) and arguments similar to (2.8), (2.9) of Eaton 
(1974), one can obtain (4.7). 

Integration by parts gives, in view of (5.1), 

(7.10) E X i = (r + j- 2)E X r 2 , J > 2 — r. 
Thus, by Schwartz inequality, 

fi r = Ex 3 r /Ex 2 r > (Exl/Exrf* = (r + 1)3, 
^<(E^/E X 2 )^=(r + 2)3, 

so (4.10) is proved. 
Consider the function 

//(*) = t-37(t)M*)- 

We have 

^(t) = (3 7 (t) 7 "(t)-2 7 '(t) 2 )/y(t) 2 

= 2(/3 3 /3 1 -/3 2 )//3 2 >0, 

by Schwartz inequality, where (3i = E(xr — t) l + . In view of (7.3), n(t) — > oo when t — > oo, 
and so 

t <-> it = /U(t) 
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is one-to-one increasing correspondence, under which, in particular, numbers t > corre- 
spond tou> u(0) = Hr (see (4.9)), and vice versa. Put 

F(t, u) = E( Xr - tf + /(u - tf = C rl {t)/{u - tf. 

Then 

(d/dt)F(t, u) = C r (u - t)" 4 (3 7 (t) + (u - t)i{t)) 
= C r (u-t)S'(t)(u- n(t)), t<u, 

and, taking into account that Y(t) < 0, we deduce 

W r {u) = F(^" 1 (-u),w) 
(7.11) = min{F(t, u) : t 6 (- oo,u)}, u>fi r . 

In particular, Vu > [i r 

W r (u) < F(0, u) = u~ 3 E X l < u- 2 E X 3 r /Hr = r/u 2 
< rlnl < 1, 

in view of (4.10), already proved, which implies (4.8) when u > [i r . If, conversely, u < [i r , 
then 

W r (u) = F(0,u) > u- 2 E X l/Hr = r/u 2 , 

and so, by (4.7), Q r (u) = min[l, r/u 2 ]. Thus (4.8) is completely proved, and so is the 
proposition. 

Proof of proposition 4.8. By proposition 4.7, Q r (u) = W r (u) for u > u r . And so (see 

(7.11) ) 

Q r (u) = F(u- l (u),u) = -C r Y(u- 1 (u)) 3 /(27^(u- 1 (u)) 2 ), u > u r . 
Using (7.3), we see that for each r 

QM/nXr > u) ~ (2/9)(^- 1 ( W )^ 2 K- 2 )exp|^ - 

~ (2/9) exp{u(u - /U _1 (w))} ~ 2e 3 /9, u -> oo. 

Proof of proposition 4.9. First, note that assertion (4.15) coincides with (7.7); (4.16) is 
a consequence of Taylor's expansion. 

Then, take for the beginning u > u r . In view of (7.11), 

(7.12) Q r (u) < F(t,u), 
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where 

t = t(u) =u + 3q/q', 
and, as above, q = q(u), q' = q'(u). Taylor's expansion gives 

7 ( T ) = 7 ( u ) - (u - r) 7 '(tt) + (it - t) 2 ^"{u)/2 

-{u- r) 3 7 w (u)/6 + (u - r) 4 / e 3 7 (4 )(r + (u - r)0)e20/6, 

e a = 1 + a + a 2 /2 + a 3 /6 + a 4 / ©V 1 "^©^. 

Jo 

Further, for s < u, 

7 ( 4 )( s )/ 7 (4)( w ) < [1 - (u - s)/u] r - 1 exp{(w 2 - s 2 )/2} 

< exp{(w - s)[-(r - l)/u + (u + s)/2]} < exp{-(w - s)q"/q'} 

= exp{(l - 0)a u }, e = (s-r)/(u-r), a u = 3qq"/q' 2 . 
Thus, 

7(t) < 7(u) - (it - r)7'(u) + (it - t)V(w)/ 2 

- (u - t) 3 7 '»/6 +(u- r) 4 7 (4) («)J(a u )/6, 

F(r, u)/P(xr > «) < 7 • (« - r)" 3 /g - i ■ (u - T)~ 2 /q 

(7.13) + 7" ' (« - r)-V(2g) - 7"7(&z) + 7 (4) • (« - r)J(a u )/(6q). 

Further, 

(7.14) 7 (4) -(«-r)/(6 ? ) = 3, 

(7.15) -7'7(6«) = I- 
Put / 2 = 7" + 6q 2 /q'. Then 

# = 7"' + 12g - 6q 2 q"/q' 2 = 6q(l - qq"/q' 2 ) > 
because of (7.7); hence, f 2 < 0, i.e., 

(7.16) 7 ,/ .(«-r)- 1 /(2g)<l. 
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Put /i = -i - 6q 3 /q' 2 . Then, by (7.7), 

f > = _y _ 18q i/ q > + I2q\"/ q * > - 7 " - G? 2 /?' = -/ 2 > 0; 
hence, /i < 0, i.e., 

(7.17) -{u-r)- 2 i/q<2/Z. 
Put /o = 7 + 6q 4 /q /3 . Then, by (7.7), 

f = y + 24^/c/ 3 - l^V?' 4 > y + 6g 3 /9 /2 = -/i > 0; 
hence, fo < 0, i.e., 

(7.18) (u-r)- 3 <y/q < 2/9. 

Getting now (7.12) - (7.18), (4.15), (4.16) together, we obtain 

A r (u) < 2/9 + 2/3 + 1 + 1 + 3.7(a u ) = 

= 2e 3 /9 + 3[,7(a u ) - J(3)] < 2e 3 /9, u > Hr- 

Thus (4.12) and (4.13) for u > \x r are proved. 
Now consider the case < u < /i r . Then (see (4.7)) 

A r (u) = rC^/ivtqiu)), (u 2 q(u))' = ug{u), 
where we put g(u) = 2q(u) — u r e~ u I 2 . We have 

g '(u) = [u 2 - (r + 2)}u r - 1 e- u2/2 < 
when <u< /z r (< (r + 2) a; see (4.10)). Thus, 

i 2e 3 i 

g{r*) < =^ A r (tt) < A r ( / u r ) < — , < u < fj, r , 

9 

in view of (4.12). If, on the contrary, g(r^) > 0, then 

(7.19) q r (r?) > (r/e) r / 2 /2. 
If u*e(r^ , jii r ) is a root of the equation g{u) = 0, then 

«*?(«*) = h(u*)/2, 
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where we set h{u) = u r+2 e~ u2 / 2 ] but 

h'(u) = -[u 2 - (r + 2)}h{u)/u > 

when < u < /U r (< (r + 2)^); thus u 2 q{u*) > rq(r^); hence 

max{A r (tt) : <u< fi r } = max{A r (r2 ), A r (jU r )}. 

By (4.8), (4.12), A r (p r ) < 2e 3 /9. Let us prove that A r (r%) < 2e 3 /9 or, equivalently, 
g r (0) < (2e 3 /9)(j r (r5) under (7.19). It suffices to show that 

(7.20) ?r (0) < y(r/e)5. 

We shall do it by induction. This is easy when r = 1, 2. If (7.20) is true for some r > 1, 
then 

q r+2 (0)=rq r (0)<r-(e 3 /9)(r/e) r * = 

= (e 3 /9)[r/(r + 2)]^/ 2 ■ e ■ [(r + 2)/e]( r + 2 V 2 < (e 3 /9)[(r + 2)/e]^/ 2 . 
Thus, (4.13) is proved in the case < u < \i r . Consider, finally, < u < r5. Then 

A r (u) = l/P(Xr > m) < A r (r^) < 2e 3 /9 

as we have just shown. So (4.13) and the whole proposition are completely proved. 
Proof of proposition 4.10. Consider 

((u) = Q' 1 (u)/(C 1 q' 1 (u)), u>^, 

where Qi(u) = Wi(u) = F(^" 1 (-u), it) (see (4.8), (7.11)). Let t stand for ^ _1 (w). Calcula- 
tions show that 

Q'iW = -7 /3 /(27 7 2 ), 
(rfM)logC(«) = -3F 2 /(77' 3 )>0, 

2 

in view of (7.2), where F = 7' — 77", 7 = 71 (£), 7' = 7^(t), 7" = "f"(t); moreover, 

((((F/7") / 7 //2 /(7V / ))V /,2 /(7 // 7 (4) )) , 7 (4)2 /(7 (5) 7 /// )) / = e^e"^ 2 > 

when t > 0, which, together with (7.2), (7.3), (7.5), leads to the inequality F > and then 
to 

(d/dt) log C(u) > 0, t > 0. 
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Now, in view of proposition 5.1 and remark 5.7, we obtain the statement of proposition 
4.10. 

Proof of theorem 6.3. Inequality (6.3) is a consequence of (6.1). By (5.5), Xd(S) > {d— l) 5 
when 5 < 0.5. Put f(u) = cq(u + h) — q(u), where 

c = 2e 3 /9, h = h{u) = (logc)/[u - (d - l)/u],u > (d - 1)3. 

Then 

f'{u) = -q'(u) + cq'(u + h) ■ (1 + ti) > -q'(u) + cq'{u + h), 
f'{u)u l - d e u2 ' 2 > 1 - c • (1 + h/uf- 1 exp | ^ - } 
> 1 - c-exp{-[u- (d- l)/u] -h} = 0. 

Hence /(u) < \/u > (d - In view of (6.1), this implies (6.3), (6.4). 

Using again (5.5), we see that if 5 J, 0, then Xd(5) — (d — l)i — > oo; hence we obtain 
(6.5), (6.6). Finally, (6.7) is trivial. 
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